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GENETIC MARKER FOR MEAT QUALITY, GROWTH, CARCA.q.q Aim 



FIELD OF THE INVENTION 

This invention relates generally to the detection 
of genetic differences associated with growth, body 
composition and reproductive traits among livestock. 
More specifically, the invention provides compositions 
and methods for predicting heritability of certain 
traits related to steroid biosynthesis and metabolism. 

BACKGROUND OF THE INVENTION 

Several publications are referenced in this 
application by author name, year and journal of 
publication in parentheses in order to more fully * • 
describe the state of the art to which this invention 
pertains. The disclosure of each of these publications 
is incorporated by reference herein. 

Steroid hormones play a crucial role in the 
differentiation, development , growth and physiological 
function of most animal tissues. The first and rate- 
limiting step in the biosynthesis of all steroid 
hormones is the conversion of cholesterol into 
pregnenolone by the cholesterol side chain cleavage 
enzyme p45 0scc- The gene which encodes P45 0scc is 
termed CYPllal. Cytochromes P450 are a diverse group of 
heme- containing mono-oxygenases (termed CYP's; see 
Nelson et al. , DNA Cell. Biol. (1993) 12: 1-51) that 
catalyze a variety of oxidative conversions, notably of 
steroids but also of fatty acids and xenobiotics. CYP*s 
are most abundantly expressed in the testis, ovary, 
placenta, adrenal glands and liver. In the reproductive 
organs, such as testis, ovary and placenta, the most 
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important steroid hormones produced are the androgens 
(e.g., testosterone); the estrogens (e.g., estradiol), 
and progestins (e.g., progesterone). In the adrenal 
glands, the most important steroids are the 
5 mineralcorticoids (e.g., aldosterone) and the 

glucocorticoids . (e.g. , Cortisol) . 

The frequent occurrence of off-odors or off-tastes 
in cooked pork from boars, commonly known as "boar odor" 
or "boar taint", is the primary reason for the common 

10 practice of castration in swine production. 5a~ 

androstenone ( 5a~androst-15-en~3 -one) , an important 
compound responsible for boar taint, is synthesized in 
the boar testis along with other 16-androstene steroids, 
androgens, and estrogens. At puberty, testicular 

15 production of A16-androstenes , in particular 5a- 

androstenone (androstenone) , increases sharply. This 
results in the accximulation of andr'ostendne in various 
body compartments, notably in fat deposits throughout 
the body and in the submaxillary salivary gland (SMG), 

20 where there is a specific binding protein for A16- 

androstenes. Concentration of androstenone and other 
A16-androstenes in the SMG are highly correlated with 
concentrations of A16- androstenes in the fat. 
Measurement of Al6-androstenes in the SMG is used, in 

25 fact, as a test method to determine the presence or 

absence of boar taint. Thus, due to this increase in 
A16-androstenes , it is common in the industry to 
castrate the young male boars to minimize this taint in 
the meat. However, if the problem of boar taint were 

30 overcome, raising boars rather than raising castrates 

(barrows) for pork would have considerable economic 
advantages. Although boars and barrows gain weight at 
equivalent rates, boars produce carcasses containing 2 0- 
3 0% less fat. Thus, boars are much more efficient at 

35 producing lean muscle. In addition, boars utilize feed 
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more efficiently than barrows (10% less feed consumed 
per unit of body weight) . Since feed represents the- 
major cost in swine production, raising boars for pork 
would have significant economic advantages. 

In the United States, approximately 9 0 million hogs 
are slaughtered annually with an approximat^e value of 
$11 billion. Feed accounts for the major portion of the 
costs of swine production, accounting for roughly 70% of 
production costs. Thus, a 10% improvement in feed 
efficiency would produce savings of 7% of the total cost 
of production. On a nation -wide basis, considering male - 
swine only, this translates to total market savings of 
$335 million. The loss of production efficiency caused 
by the practice of castration represents a very large 
economic loss to the swine industry throughout the 
world. 

Identification of the inheritance pattern (s) and 
genetic bases for alterations in steroid biosynthesis in 
livestock has utility in the production of meat, dairy 
and egg products of higher quality. It is an object of 
the present invention to provide compositions and 
methods for identifying such genetic alterations. 

SUMMARY OF THE INVENTION 

In accordance with the present invention, methods 
for identifying genetic alterations associated with 
steroid biosynthesis are provided. In one embodiment of 
the invention, the presence or absence of a polymorphic 
marker in the CYPllal DNA of a test subject is 
determined. Such test subjects are selected from 
important livestock species, including without 
limitation, pigs, cows, chickens and sheep. In 
accordance with the present invention, it has been 
determined that certain polymorphisms in the CYPllal 
gene are associated with increased growth, reproductive 
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and carcass traits. Thus, screening methods are 
provided for identifying those test subjects which 
possess these beneficial CYPllal alleles. 
Identification of such livestock facilitates the 
implementation of breeding programs for developing stock 
having these improved genetic traits. 

As is well known to those of skill in the art, a 
variety of techniques may be utilized when comparing 
nucleic acid molecules for sequence differences. These 
include by way of example, restriction fragment length 
polymorphism analysis, heteroduplex analysis, single 
strand conformation polymorphism analysis, denaturing 
gradient electrophoresis and temperature gradient 
electrophoresis. 

In a preferred embodiment of the invention, the 
CYPllal polymorphism is a restriction fragment 
polymorphism and the assay comprises identifying the 
CYPllal gene from genetic material isolated from the 
test subject; exposing the gene to a restriction enzyme 
that yields restriction fragments of the gene of varying 
length; separating the restriction fragments to form a 
restriction pattern, such as by electrophoresis or HPLC 
separation; and comparing the resulting restriction 
fragment pattern from a test subject CYPllal gene that 
is either known to have or not to have the desired 
marker. If a test subject tests positive for the marker, 
such a subject can be considered for inclusion in the 
breeding program. If the test subject does not test 
positive for the marker genotype, the test subject can 
be culled from the group and otherwise used. 

In a particularly preferred embodiment, the test 
subject is a pig, the polymorphism is in the 5 ' UTR of 
the CYPllal gene and the restriction enzyme is SphI . 
Thus, in this aspect, it is an object, of the invention 
to provide a method of screening pigs to determine those 
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more likely to have decreased testis weight and reduced 
boar taint, longer carcasses, improved rate of gain, or 
heavier weaning weights when bred to or to select 
against pigs which have alleles indicating larger testis 
size, increased boar taint, reduced carcass length, 
lower rate of gain, or lighter weaning weights. As used 
herein ''smaller testis size" means a significant 
decrease in testis size below the mean for a given 
population. As used herein "reduced boar taint" means a 
significant decrease in boar taint below the mean for a 
given population. As used herein ''increased carcass 
length" means a significant increase in carcass length 
above the mean for a given population. As used herein 
"higher rate of gain" means a significant increase in 
rate of gain above the mean for a given population. As 
used herein "heavier weaning weights" mean an increase 
in weaning weight above the mean for "a" given population. 
The method of the invention comprises the steps: 1) 
obtaining a sample of genomic DNA from a pig; and 
2) analyzing the genomic DNA obtained in 1) to determine 
which CYPllal allele (s) is/are present. Briefly, a 
sample of genetic material is obtained from a pig, and 
the sample is analyzed to determine the presence or 
absence of a polymorphism in the CYPllal gene that is 
correlated with reduced boar taint, smaller testis size, 
increased carcass length, higher rate of gain, and/or 
increased weaning weight . 

In a most preferred embodiment the gene is isolated 
by the used of primers and DNA polymerase to amplify a 
specific region of the gene which contains the 
polymorphism. Next the amplified region is digested with 
a restriction enzyme and fragments are separated. 
Visualization of the RFLP pattern is by simple staining 
of the fragments, or by labeling the primers or the 
nucleoside triphosphates used in amplification. 
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In another embodiment, the . invent ion comprises a 
method for identifying a genetic marker for boar taint, 
testis size, carcass length, rate of gain, and/or 
weaning weight in a particular population. Male and 
female pigs of the same breed or breed cross or similar 
genetic lineage are bred, and traits such as boar taint, 
testis size, carcass length, rate of gain, and/or 
weaning weight are determined. A polymorphism in the 
CYPllal gene of each pig is identified and associated 
with the traits of boar taint, testis size, carcass 
length, rate of gain, and/or weaning weight. Preferably, 
RFLP analysis is used to determine the polymorphism, and 
most preferably, the DNA is digested with the. 
restriction endonuclease SphI , or other restriction 
endonuclease that differentially cleaves the restriction 
site based on the presence or absence of the 
polymorphism. 

Methods are also provided to establish linkage 
between specific alleles of alternative DNA markers and 
alleles of DNA markers known to be associated with a 
particular gene (e.g. the CYPllal gene discussed 
herein) , which have been previously shown to be 
associated with a particular trait. Thus, selection for 
pigs likely to have reduced boar taint, smaller testes, 
increased carcass length, higher rate of gain, and/or 
heavier weaning weights, or alternatively to select 
against pigs likely to have increased boar taint, larger 
testes, reduced carcass length, lower rate of gain, 
and/or lighter weaning weights, may be done indirectly, 
by selecting for certain alleles of a CYPllal associated 
marker through the selection of specific alleles of 
alternative markers located on the same chromosome as 
CYPllal. 

The invention further comprises kits for evaluating 
a sample of test subject DNA for the presence in test 
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subject genetic material of a desired marker located in 
the test subject CYPllal gene indicative of the 
inheritable traits of boar taint (in the' pig), testis 
size, carcass length, rate of gain, and/or weaning 
5 weight. At a minimum, using the pig as the test subject, 

the kit is a container with one or more reagents that 
identify a polymorphism in the pig CYPllal gene. 
Preferably, the reagent is a set of oligonucleotide 
primers capable of amplifying a fragment of the pig 

10 CYPllal gene that contains the polymorphism. More 

preferably, the kit further contains a restriction 
enzyme that cleaves the pig CYPllal gene in at least one 
place. In a most preferred embodiment the restriction 
enzyme is SphI or one which cuts at the same recognition 

15 site. 

The following definitions are provided to 
facilitate an understanding of the present invention: 
The term "corresponds to" is used herein to mean 
that a polynucleotide sequence- is homologous to all or a 

20 portion of a reference polynucleotide sequence, or that 

a polypeptide sequence is identical to a reference 
polypeptide sequence. In contradistinction, the term 
"complementary to" is used herein to mean that the 
complementary sequence is homologous to all or a portion 

25 of a reference polynucleotide sequence. For 

illustration, the nucleotide sequence "TATAC" 
corresponds to a reference sequence "TATAC" and is 
complementary to a reference sequence "GTATA" . 
Hybridization probes may be DNA or RNA, or any synthetic 

30 nucleotide structure capable of binding in a base- 

specific manner to a complementary strand of nucleic 
acid. For example, probes include peptide nucleic acids, 
as described in Nielsen et al . , Science 254:1497-1500 
(1991). 

35 
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Linkage" describes the tendency of genes, alleles, 
loci or genetic markers to be inherited together as a 
result of their location on the same chromosome, and, is 
measured by percent recombination (also called 
recombination fraction, or 9) between the two genes, 
alleles, loci ■ or genetic markers. The closer two loci 
physically are on the chromosome, the lower the 
recombination fraction will be. Normally, when a 
polymorphic site from within a disease-causing gene, is 
tested for linkage with the disease, the recombination 
fraction will be zero, indicating that the disease and 
the disease-causing gene are always co-inherited. In 
rare cases, when a gene spans a very large segment of 
the genome, it may be possible to observe recombination 
between polymorphic sites on one end of the gene and 
causitive mutations on the other. However, if the 
causative mutation is the polymorphism being ' tested for 
linkage with the disease, no recombination will be 
observed. - 

"Centimorgan" is a unit of genetic distance 
signifying linkage between two genetic markers, alleles, 
genes or loci, corresponding to a probability of 
recombination between the two markers or loci of 1% for 
any meiotic event . 

"Linkage disequilibrium" or "allelic association" 
means the preferential association of a particular 
allele, locus, gene or genetic marker with a specific 
allele, locus, gene or genetic marker at a nearby 
chromosomal location more frequently than expected by 
chance for any particular allele frequency in the 
population . 

An "oligonucleotide" can be DNA or RNA, and single- 
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or double-stranded. Oligonucleotides can be naturally 
occurring or synthetic, but are typically prepared by 
synthetic means. - 

The term "primer" refers to an oligonucleotide 
capable of acting as a point of initiation of DNA 
synthesis under conditions in which synthesis of a 
primer extension product complementary to a nucleic acid 
strand is induced, i.e., in the presence of four 
different nucleoside triphosphates and an agent for 
polymerization (i.e., DNA polymerase or reverse 
transcriptase) in an appropriate buffer and at a 
suitable temperature. A primer is preferably a single- 
stranded oligonucleotide. The appropriate length of a 
primer depends on the intended use of the primer but 
typically ranges from 15 to 30 nucleotides. Short primer 
molecules generally require cooler temperatures to form 
sufficiently stable hybrid complexes with the template. 
A primer need not reflect the exact sequence of the 
template but must be sufficiently complementary to 
hybridize with a template. The term "primer" may refer 
to more than one primer, particularly in the case where 
there is some ambiguity in the information regarding one 
or both ends of the target region to be amplified. For 
instance, if a region shows significant levels of 
polymorphism or mutation in a population, mixtures of 
primers can be prepared that will amplify alternate 
sequences. A primer can be labeled, if desired, by 
incorporating a label detectable by spectroscopic, 
photochemical, biochemical, immunochemical, or chemical 
means. For example, useful labels include ^^P, 
fluorescent dyes, electron-dense reagents, enzymes (as 
commonly used in an ELISA) , biotin, or haptens and 
proteins for which aritisera or moriocloiial antibodies are 
available. A label can also be used to "capture" the 
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primer, so as to facilitate the immobilization of either 
the primer or a primer extension product, such as 
amplified DNA, on a solid support. 

"Chromosome 7 set" in boars for example, means the 
two copies of chromosome 7 found in somatic cells or the 
one copy in germ line cells of a test subject or family 
member. The two copies of chromosome 7 may be the same 
or different at any particular allele, including alleles 
at or near the locus of interest. The chromosome 7 set 
may include portions of chromosome 7 collected in 
chromosome 7 libraries, such as plasmid, yeast, or phage 
libraries, as described in Sambrook et al . , Molecular 
Cloning, 2nd Edition, and in Mandel et al . , Science ~ 
258:103-108 (1992) . 

"Penetrance" is the percentage of individuals with 
a defective gene or polymorphism who show some symptoms 
of a trait resulting from that genetic alteration. 
Expressivity refers to the degree of expression of the 
trait (e.g., mild, moderate or severe). 

"Polymorphism" refers to the occurrence of two or 
more genetically determined alternative sequences or 
alleles in a population. A polymorphic marker is the 
locus at which divergence occurs. Preferred markers have 
at least two alleles, each occurring at frequency of 
greater than 1%. A polymorphic locus may be as small as 
one base pair difference. Polymorphic markers suitable 
for use in the invention include restriction fragment 
length polymorphisms, variable number of tandem repeats 
(VNTR's), hypervariable regions, minisatellites , 
dinucleotide repeats, trinucleotide repeats, 
tetranucleotide repeats, and other microsatellite 
sequences . 
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"Restriction fragment length polymorphism" (RFLP) 
means a variation in DNA sequence that alters the length 
of a restriction fragment as described in Botstein et 
al.. Am. J. Hum. Genet. 32:314-331 (1980). The 
restriction fragment length polymorphism may create or 
delete a restriction site, thus changing the length of 
the restriction fragment. For example, the DNA sequence 
GAATTC are the six bases, together with its 
complementary strand CTTAAG which comprises the 
recognition and cleavage site of the restriction enzyme 
EcoRI. Replacement of any of the six nucleotides on 
either strand of DNA to a different nucleotide destroys 
the EcoRI site. This RFLP can.be detected by, for 
example, amplification of a target sequence including 
the polymorphism, digestion of the amplified sequence 
with EcoRI, and size fractionation of the reaction 
products on an agarose or acrylamide gel. If the only' 
EcoRI restriction enzyme site within the amplified 
sequence is the . polymorphic site, the target sequences 
comprising the restriction site will show two fragments 
of predetermined size, based on the length of the 
amplified sequence. Target sequences without the 
restriction enzyme site will only show one fragment, of 
the length of the amplified sequence. Similarly, the 
RFLP can be detected by probing an EcoRI digest of 
Southern blotted DNA with a probe from a nearby region 
such that the presence or absence of the appropriately 
sized EcoRI fragment may be observed. RFLP ' s may be 
caused by point mutations which create or destroy a 
restriction enzyme site, VNTR's, dinucleotide repeats, 
deletions, duplications, or any other sequence-based 
variation that creates or deletes a restriction enzyme 
site, or alters the size of a restriction fragment. 

"Variable number of tandem repeats" (VNTR's) are 
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short sequences of nucleic acids arranged in a head to 
tail fashion in a tandem array, and found in each 
individual, as described in Wyman et al,, Proc . Nat . 
Acad. Sci. 77:6754-6758 (1980). Generally, theVNTR 
sequences are comprised of a core sequence of at least 
16 base pairs, with a variable number of repeats of that 
sequence. Additionally, there may be variation within 
the core sequence, Jeff'erys et al., Nature 314:67-72 
(1985) . These sequences are highly individual, and 
perhaps unique to each individual . Thus, VNTR's may 
generate restriction fragment length polymorphisms, and 
may additionally serve as size-based amplification 
product differentiation markers. 

"Microsatellite sequences" comprise segments of at 
least about 10 base pairs of DNA consisting of a 
variable number of tandem repeats of short (1-6 base 
pairs) sequences of DNA(Clemens et al . , Am. j. Hum. 
Genet. 49:951-960 1991). Microsatellite sequences are 
generally spread throughout the chromosomal DNA of an 
individual. The nximber of repeats in any particular 
tandem array varies greatly from individual to 
individual, and thus, microsatellite sequences "may serve 
to generate restriction fragment length polymorphisms, 
and may additionally serve as size-based amplification 
product differentiation markers. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 depicts the sequence of approximately 63 0 
base pairs of the 5 ' untranslated region of the porcine 
CYPllAl gene (SEQ ID NO : 1) . The PGR fragment was 
produced using DNA extracted from porcine testis 
samples. The primers iised were forward primer (SEQ ID 
NO: 2) and reverse primer (SEQ ID N0:3). 
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Figure 2 depicts the polymorphic pattern of Sphl 
digested PGR product. The forward and reverse primers 
were used in the following PGR conditions : Two minutes & 
94°C, 35 cycles of one minute @ 94°C, one minute @ 55°C, 
one minute @ 72 °C and a final two minutes © 72 "'C. 
Samples were digested with Sphl (New England Biolabs) 
and separated on 1.5% agarose gel at 50 volts for 45 
minutes at room temperature. Gels were stained with 
ethidium bromide. Lane 1: low molecular weight 
markers; Lane 2: undigested PGR fragment; Lanes 3 and. 7 : 
genotype CT; and Lanes 4-6: genotype CC . A Restriction 
Fragment Length Polymorphism (RFLP) was discovered 
whereby the 63 0 bp PGR fragment from GG pigs was 
digested into a 450 bp product while the PGR fragment 
from the GT pigs was only partially digested, which 
indicates the presence of the T allele. 

Figure 3 depicts the concentrations of submaxillary 
salivary gland (SMG) A- 16 androstenes in boars of the CG 
versus the GT genotype. Five out of thirty of the CG 
boars exhibited SMG A-16 androstene concentrations 
greater than the recommended threshold level for 
identifying tainted carcasses (55 ug/g SMG) . All of 
boars carrying the T allele (n=20) were below the 
recommended threshold level for boar taint . 

Figure 4 is a table that shows the observed 
differences in various growth, carcass, and reproductive 
traits of CC versus CT boars. The greater weights of 
testes, submaxillary glands and bulbourethral glands, as 
well as higher concentrations of SMG A-16-androstenes , 
are all indications of higher boar taint in the CG 
boars. Surprisingly the CC boars also had 5.9% increase 
in rate of gain and longer carcasses as well. 
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Figure 5 shows the sequence of the bovine CYPllal 
gene, including 948 nucleotide of the 5' UTR. 

Figure 6 shows the sequence of the chicken CYPllal 
gene, including 137 nucleotide of the 5 'UTR. 

DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, materials 
and methods are provided for diagnosing genetic 
alterations in the CYPllal gene associated with aberrant 
or increased sterioid biosynthesis in livestock. In the 
mouse, polymorphic variation in CYPllal is responsible 
for genetic differences in testosterone production. In 
mouse, CYPllal maps to chromosome 9. This region is. 
syntenic with porcine chromosome 7. 

A principle cause of taint in the boar is the 
presence of the A-16 steroid, androstenone , which is one 
of many steroids . produced, in the boar testis.. 
Androstenone and androstenone metabolites such as 
androstenol are secreted by the testis and sequestered 
in the submaxillary salivary glands (SMG) . During 
mating behavior these steroids are. released into the air 
through the saliva and function as sexual pheromones 
whereby they induce estrous behavior in female pigs 
(sows) . Since A-15 steroids are highly lipophilic, 
androstenone is also stored in body fat, where its 
presence in high concentrations contributes to the off- 
flavors in pork known as boar taint. 

Concentrations of androstenone in the fat are 
highly heritable. A quantitative trait locus (QTL) has 
been identified for fat androstenone (microsatelli te 
marker SO102), which is located on porcine chromosome 7 
in the region of the swine leukocyte antigen complex 
(SLA) . In accordance with the present invention, a 
particular genetic polymorphic sequence has been 
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identified which is associated with androstenone 
production and boar taint. 

The presence of a quantitative trait locus (QTL) 
for fat androstenone on chromosome 7 in the pig suggests 
5 that porcine CYPllal may be located on chromosome 7 and, 

as the rate limiting enzyme in steroid synthesis may be 
an important control point for androsterone synthesis 
and the occurrence of boar taint . 

A genomic search was conducted to compare 2.4 kb of 

10 the untranslated region (5'UTR) of the porcine CYPllal 

gene from a preselected group of boars in order to 
determine if polymorphisms exist which are associated 
with compounds which cause boar taint. First, 
comparisons of the genotypes of five "high taint" and 

15 five "low taint" boars by direct sequencing of PGR 

products (using the ABI Prism 377 at the Nucleic Acid 
Facility, Penn State University Biotechnology Institute)' " 
revealed the presence of one single nucleotide 
polymorphism (SNP) in the entire 2.4 kb 5' UTR: This 

20 SNP (CT allele) was discovered only in boars that 

exhibited low concentrations of delta~16 steroids in the 
salivary gland, a measurement that is highly correlated 
with androstenone concentrations in the fat. This 
polymorphism consists of either a thymidine (T) or a 

25 cytosine (C) at position - 155 from the start site of 

translation. The polymorphism was located in a 
restriction enzyme recognition site such that the 
presence of the T allele would change the restriction 
fragment length pattern observed after digestion with 

30 specific restriction enzymes.- In this particular case, 

the restriction enzyme used was Sphl (New England 
Biolabs) . Additional restriction enzymes are available 
which are able to cut the same DNA sequence. Presence or 
absence of the T allele was determined by examination of 

35 restriction digests of CYPllal 5 'UTR using Sphl. 
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Presence of the T allele, either homozygous (TT) or 
heterozygous (CT) , was associated with low boar taint. 
Presence of the CC allele was associated with high boar 
taint, as well as with increased testis weight, 
bulbourethral gland length and weight and submaxillary 
salivary gland weights . In addition, boars that 
possessed the CC allele exhibited a 5.9% improvement in 
rate of gain as well as longer carcasses. 

The discovery that this polymorphism is associated 
with increased rate of gain and carcass length in 
addition to its effects on reproductive traits indicates 
that this polymorphism affects many other growth and 
developmental traits. Thus, presence or absence of this 
polymorphism may also be associated with feed efficiency 
and with birth weight. The association of this 
polymorphism with reproductive traits such as testis 
weight, bulbourethral gland length and weight, 
submaxillary gland weight, and A-16 steroid 
concentrations, are all indications of a general effect 
on gonadal steroid production. 

The data presented herein .indicate -that the 
presence or absence of the CYPllal polymorphism may' have 
effects on other reproductive traits such as ovulation 
rate, litter size, milk production, and fertility (both 
male and female) . Additionally, since the adrenal gland 
is another site where CYPllal is expressed to produce 
glucocorticoid steroids such Cortisol, this polymorphism 
may be associated with disease response traits since 
these traits are known to be modulated by adrenal 
steroids. 

In a further aspect of the invention, this genetic 
marker may also be used in combination with other 
genetic markers to produce favorable combinations of 
alleles or to select against those test subjects 
carrying unfavorable combinations . Examples of some of 
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these previously identified genes are: tumor necrosis 
factor alpha (TNFa) , CYPllal, prolactin (PRL) , estrogen 
receptor (ER) and prolactin receptor (PRIR) . Examples 
of some of these previously identified microsatellite 
markers are: S0064, S0102, S0078, S0158, S0066, SW304, 
SW1083, SOlOl, and S0212. 

Additional polymorphisms in the porcine CYPllal 
gene may be identified using the methods of the present 
invention. Such alterations may occur in the 
untranslated region of the gene but may also be 
identified in the translated region, as well as in the 
intronic and exonic sequences. It is likely that a 
subset of these changes will cause or be associated with 
changes in androgen function and phenotypic traits. 
Once such genetic alterations are identified, it is 
possible to introduce these or similar changes into the 
genome by known techniques in order to produce 
transgenic animals that possess a desired CYPllal 
genotype. The data further suggest that polymorphisms 
in homologous areas of CYPllal of other agriculturally 
important species are likely to cause or be associated 
with similar changes in function and phenotype. 

In a further aspect of the invention, the 
corresponding CYPllal sequences from the cow and the 
chicken are provided. This information facilitates 
genomic scanning of the 5'UTR of the bovine or chicken 
CYPllal to reveal polymorphisms that are associated with 
growth, carcass traits, and reproduction (including milk 
production and egg production) . 

DIAGNOSTIC KITS FOR PRACTICING 
THE METHODS OF THE INVENTION 

The present invention also includes kits for the 
practice of the methods of the invention. The kits 
comprise a vial, tube, or any other container which 
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contains one or more oligonucleotides, which hybridizes 
to a DNA segment which DNA segment which is or is linked 
to the CYPllal gene. Some kits contain, two such . 
oligonucleotides, which serve as primers to amplify a 
segment of chromosome DNA. The segment selected for 
amplification can be a CYPllal gene that includes a site 
at which a variation is known to occur. Some kits 
contain a pair of oligonucleotides for detecting 
precharacterized variations. For example, some kits 
contain oligonucleotides suitable for allele-specif ic 
oligonucleotide hybridization, or allele-specif ic 
amplification hybridization. The kits of the invention 
may also contain components of the amplification system, 
including PGR reaction materials such as buffers and a 
thermostable polymerase. In other embodiments, the kit 
of the present invention can be used in conjunction with 
commercially available amplification kits, such as may 
be obtained from GIBCO BRL (Gaithersburg, Md.) 
Stratagene (La Jolla, Calif.), Invitrogen (San Diego, 
Calif.), Schleicher & Schuell (Keene, N.H.), Boehringer 
Mannheim (Indianapolis, Ind. ) . The kits may optionally . 
include positive or negative control reactions or • 
markers, molecular weight size markers for gel 
el ectrophoresis , and the like. The kits usually include 
labeling or instructions indicating the suitability of 
the kits for diagnosing steroid biosynthesis alterations 
and indicating how the oligonucleotides are to be used 
for that purpose. The term "label" is used generically 
to encompass any written or recorded material that is 
attached to, or otherwise accompanies the diagnostic at 
any time during its manufacture, transport, sale or use. 

MODES OF PRACTICING THE INVENTION 

1 . Linkage Analysis 

Determining linkage between a polymorphic marker 
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and a locus associated with a particular phenotype is 
performed by mapping polymorphic markers and observing ■ ■ 
whether they co- segregate with the high taint phenotype - 
(for example) on a chromosome in an informative meiosis. 
See, e.g., Kerem et al.. Science 245:1073-1080 (1989); 
Monaco et al . , Nature 316:842 (1985); Yamoka et al , , 
Neurology 40:222-226 (1990), and as reviewed in Rossiter 
et al., FASEB Journal 5:21-27 (1991). A single pedigree 
rarely contains enough informative meioses to provide 
definitive linkage, because families are often small and 
markers may be not sufficiently informative. For 
example, a marker may not be polymorphic in a particular 
family. 

Linkage may be established by an affected sib-pairs 
analysis as described in Terwilliger & Ott, Handbook of 
Human Genetic Linkage (Johns Hopkins, Md. , 1994), Ch. 
26. This approach requires no assumptions to be' made 
concerning penetrance or variant frequency, but only 

takes into account the data of a relatively small 

proportion (i.e., the SIB pairs) of all the family 
members whose phenotype and polymorphic markers- have 
been determined. Specifically, the affected SIB pairs 
analysis scores each pair of affected SIRS as sharing 
(concordant) or not sharing (discordant) the same 
allelic variant of each polymorphic marker. For each 
marker, a probability is then calculated that the 
observed ratio of concordant to discordant SIB pairs 
would arise without linkage of the marker. 

As described in Thompson & Thompson, Genetics in 
Medicine, 5th ed, 1991, W.B. Saunders Company, 
Philadelphia, in linkage analysis, one calculates a 
series of likelihood ratios (relative odds) at various 
possible values of 0, ranging from 6 =0.0 (no 
recombination) to 6 =0.50 (random assortment). Thus, the 
likelihood ratio at a given value of 0 is (likelihood of 
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data if aloci are linked at 0 )/{ likelihood of data if 
loci are unlinked) . Evidence in support of linkage is 
usually expressed as the logn, of this ratio and called, a 
"lod score" for "logarithm of the odds." For example, a 
lod score of 5 indicates 100,000:1 odds that the linkage 
being observed did not occur by chance. 

The use of logarithms allows data collected from 
different families to be combined by simple addition. 
Computer programs are available for the calculation of 
lod scores for differing values of 6. Available programs 
include LIPED, and MLINK (Lathrop, Proc. Nat. Acad. Sci . 
81:3443-3446 (1984). 

For any particular lod score, a recombination 
fraction may be determined from mathematical tables. See 
Smith et al . , Mathematical tables for research workers 
in human genetics (Churchill, London, 1961) and Smith, 
Ann. Hum. Genet. 32:127-150 (1968). The value of 0 at 
which the lod score is the highest is considered to be 
the best estimate of -the recombination fraction, the 
"maximxim likelihood estimate" . 

Positive lod score values suggest that the two loci 
are linked, whereas negative. values suggest that .linkage 
is less likely (at that value of 0) than the possibility 
that the two loci are unlinked. By convention, a 
combined lod score of +3 or greater (equivalent to 
greater than 1000:1 odds in favor of linkage) is 
considered definitive evidence that two loci are linked. 
Similarly, by convention, a negative lod score of -2 or 
less is taken as definitive evidence against linkage of 
the two loci being compared. If there are sufficient 
negative linkage data, a locus can be excluded from an 
entire chromosome, or a portion thereof, a process 
referred to as exclusion mapping. The search is then 
focused oh the remaining non-exclxided chromosomal 
locations. For a general discussion of lod scores and 
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15 



linkage analysis, see, e.g., T. Strachan, Chapter 4, 
"Mapping the human genome" in The Human Genome, 1992 
BIOS Scientific Publishers Ltd. Oxford. 

The data can also be subjected to haplotype 
5 analysis. This analysis assigns allelic markers between 

the chromosomes of. an individual such that the number of 
recombinational events needed to account for segregation 
between generations is minimized. Linkage may also be 
established by determining the relative likelihood of 
10 obtaining observed segregation data for any two markers 

when the two markers are located at a recombination 
fraction 6, versus the situation in which the two 
markers are not linked, and thus segregating 
independently. 



2- Isolation and Amplification of DNA 



Samples of , patient,, proband, test subject, or 
family member genomic DNA are isolated from any 

20 convenient source including saliva, buccal cells, hair 

roots, blood, cord blood, amniotic fluid, interstitial . 
fluid, peritoneal fluid, chorionic villus, and any other 
suitable cell or tissue sample with intact interphase 
nuclei or metaphase cells. The cells can be obtained 

25 from solid tissue as from a fresh or preserved organ or 

from a tissue sample or biopsy. The sample can contain 
compounds which are not naturally intermixed with the 
biological material such as preservatives, 
anticoagulants, buffers, fixatives, nutrients, 

30 antibiotics, or the like. 

Methods for isolation of genomic DNA from these 
various sources are described in, for example, Kirby, 
DNA Fingerprinting, An Introduction, W.H. Freeman & Co. 
New York (1992) . Genomic DNA can also be isolated from 

35 cultured primary or secondary cell cultures or from 
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transformed cell lines derived from any of the 
aforementioned tissue samples. 

Samples of patient, proband/ test subject or family 
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member RNA can also be used. RNA can be isolated from 
tissues expressing the CYPllal gene as described in 
Sambrook et al., supra. RNA can be total cellular RNA, 
mRNA, poly A+ RNA, or any combination thereof. For best 
results, the RNA is purified, but can also be unpurified 
cytoplasmic RNA. RNA can be reverse transcribed to form 
DNA which is then used as the amplification template, 
such that the PGR indirectly amplifies a specific 
population of RNA transcripts. See, e.g.., Sambrook, 
supra, Kawasaki et al . , Chapter 8 in PGR Technology, 

(1992) supra, and Berg et al . , Hum. Genet. 85:655-658 

(1990) . 

3. PGR Amplification 

The most common means for amplification is 
polymerase chain reaction (PGR), as described in U.S. 
Pat. Nos. 4,683,195, 4,683,202, 4,965,188 each of which 
is hereby incorporated by reference. .. If PGR is used to : 
amplify the target regions in blood cells, heparinized 
whole blood should be drawn in a sealed vacuum tube kept 
separated from other samples and handled with clean 
gloves. For best results, blood should be processed 
immediately after collection; if this is impossible, it 
should be kept in a sealed container at 4° C until use. 
Gells in other physiological fluids may also be assayed . 
When using any of these fluids, the cells in the fluid 
should be separated from the fluid component by 
centrif ugation. 

Tissues should be roughly minced using a sterile, 
disposable scalpel and a sterile needle (or two 
scalpels) in a 5 mm Petri dish. Procedures for removing 
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paraffin from tissue sections are described in *a variety 
of specialized handbooks well known to those skilled in 
the art . 

To amplify a target nucleic acid sequence in a 
sample by PGR, the sequence must be accessible to the 
components of. the amplification system. One method of 
isolating target DNA is crude extraction which is useful 
for relatively large samples. Briefly, mononuclear cells 
from samples of blood, amniocytes from amniotic fluid, 
cultured chorionic villus cells, or the like are 
isolated by layering on sterile Ficoll-Hypaque gradient 
by standard procedures. Interphase cells are collected 
and washed three times in sterile phosphate buffered 
saline before DNA extraction. If testing DNA from 
peripheral blood lymphocytes, an osmotic shock 
(treatment of the pellet for 10 sec with distilled 
water) is suggested, followed by two additional washings 
if residual red blood cells are visible following the 
initial washes. This will prevent the inhibitory effect 
of the heme group carried by hemoglobin on the PGR 
reaction. If PGR testing is not performed iitimediately 
after sample collection, aliquots of 10^ ^ells- can- -be 
pelleted in sterile Eppendorf tubes and the dry pellet 
frozen at -20*" G until use. 

The cells are resuspended (10^ nucleated cells per 
100 uD in a buffer of 50 mM Tris-HCl (pH 8.3), 50 mM 
KGl 1.5 mM MgGl2, 0.5% Tween 20, 0.5% NP40 supplemented 
with 100 ug/ml of proteinase K. After incubating at 56° 
C for 2 hr, the cells are heated to 95° G for 10 min to 
inactivate the proteinase K and immediately moved to wet 
ice (snap-cool) . If gross aggregates are present, 
another cycle of digestion in the same buffer should be 
undertaken. Ten \il of this extract is used for 
amplification. 



When extracting DNA from tissues, e.g., chorionic 
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villus cells or confluent cultured cells, the amount of 
the above mentioned buffer with proteinase K may vary 



is incubated for 4-10 hrs at 50°-60° C and then at 95° C 
for 10 minutes to inactivate the proteinase. During 
longer incubations, fresh proteinase K should be added 
after about 4 hr at the original concentration. 

When the 'sample contains a small number of cells, 
extraction may be accomplished by methods as described 
in Higuchi, "Simple and Rapid Preparation of Samples for 
PGR", in PGR Technology, Ehrlich, H. A. (ed.), Stockton 
Press, New York, which is incorporated herein by 
reference. PGR can be employed to amplify target regions 
from chromosome 7 in very small numbers of cells (1000- 
5000) derived from individual colonies from bone marrow 
and peripheral blood cultures. The cells in the sample 
are suspended in 20 ul of PGR lysis buffer (10 mM Tris- 
HGl (pH 8.3), 50 itiM KGl, 2.5 mM MgClj, 0.1 mg/ml gelatin, 
0.45% NP40, 0.45% Tween 20) and frozen until use. When 
PGR is to be performed, 0.6 yl of proteinase K (2 mg/ml) 
is added to the cells in the PGR ly3is buffer. The . 
sample is then heated to about 6&° G and incub'ated for 1 
hr. Digestion is stopped through inactivation of the " 
proteinase K by heating the samples to 9 5"* C for 10 min 
and then cooling on ice . 

A relatively easy procedure for extracting DNA for 
PGR is a salting out procedure adapted from the method 
described by Miller et al . , Nucleic Acids Res. 16:1215 
(1988) , which is incorporated herein by reference. 
Mononuclear cells are separated on a Ficoll-Hypaque 
gradient. The cells are resuspended in 3 ml of lysis 
buffer (10 mM Tris-HGl, 400 mM NaGl, 2 mM Naa EDTA, pH 
8.2). Fifty \il of a 20 mg/ml solution of proteinase K 
and 150 111 of a 20% SDS solution are added to the cells 
and then incubated at 37° G overnight. Rocking the tubes 



according to the size of the tissue sample. The extract 
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during incubation will improve the digestion of the 
sample. If the proteinase K digestion is incomplete 
after overnight incubation (fragments are still 
visible) ; an additional 50 ul of the 20 mg/ml proteinase 
K solution is mixed in the solution and incubated for 
another night at 37° C on a gently rocking or rotating 
platform. Following adequate digestion, one ml of a 6M 
NaCl solution is added to the sample and vigorously 
mixed. The resulting solution is centrifuged for 15 
minutes at 3000 rpm. The pellet contains the 
precipitated cellular proteins, while the supernatant 
contains the DNA. The supernatant is removed to a 15 ml 
tube that contains 4 ml of isopropanol. The contents, of 
the tube are mixed gently until the water and the 
alcohol phases have mixed and a white DNA precipitate 
has formed. The DNA precipitate is removed and dipped in 
a solution of 7 0% ethanol and gently mixed. The DNA 
precipitate is removed from the ethanol and air-dried. 
The precipitate is placed in distilled water and 
dissolved. 

Kits for the extraction of high-molecular weight 
DNA for PCR include a Genomic Isolation Kit - A. S . A . P . - 
(Boehringer Mannheim, Indianapolis, Ind.), Genomic DNA 
Isolation System (GIBCO BRL, Gaithersburg, Md. ) , Elu- 
Quik DNA Purification Kit (Schleicher & Schuell, Keene, 
N.H.), DNA Extraction Kit (Stratagene, La Jolla, 
Calif.), TurboGen Isolation Kit (Invitrogen, San Diego, 
Calif.), and the like. Use of these kits according to 
the manufacturer's instructions is generally acceptable 
for purification of DNA prior to practicing the methods 
of the present invention. 

The concentration and purity of the extracted DNA 
can be determined by spectrophotometric analysis of the 
absorbance of a diluted aliquot at 260 nm and 280 nih. 
After extraction of the DNA, PCR amplification may 
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proceed. The first step of each cycle of the PGR 
involves the separation of the nucleic acid duplex 
formed by the primer extension. Once the strands are 
separated, the next step in PGR involves hybridizing the 
separated strands with primers that flank the target 
sequence. The primers are then extended to form 
complementary copies of the target strands. For 
successful PGR amplification, the primers are designed 
so that the position at which each primer hybridizes 
along a duplex sequence is such that an extension 
product synthesized from one primer, when separated from 
the template (complement), serves as a template for the 
extension of the other primer. The cycle of 
denaturation, hybridization, and extension is repeated 
as many times as necessary to obtain the desired amount 
of amplified nucleic acid. 

In a particularly useful embodiment of PGR 
amplification, strand separation is achieved by heating 
the reaction to a sufficiently high temperature for an 
sufficient time to cause the denaturation of the duplex- 
but not to cause an irreversible-- denaturation of the 
polymerase (see U.S. Pat. No.' 4 , 965 , 188 , incorporated 
herein by reference) . Typical heat denaturation involves 
temperatures ranging from about 80'' C to 105° C for 
times ranging from seconds to minutes. Strand 
separation, however, can be accomplished by any suitable 
denaturing method including physical, chemical, or 
enzymatic means. Strand separation may be induced by a 
helicase, for example, or an enzyme capable of 
exhibiting helicase activity. For example, the enzyme 
RecA has helicase activity in the presence of ATP. The 
reaction conditions suitable for strand separation by 
helicases are known in the art (see Kuhn Hoffman- 
Berling, 1978, CSH-Quantitative Biology, 43:63-^67; and 
Radding, 1982, Ann. Rev. Genetics 16:405-436, each of 
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which is incorporated herein by reference). 

Template-dependent extension of primers in PGR is 
catalyzed by a polymerizing agent in the presence of 
adequate amounts of four deoxyribonucleotide 
5 triphosphates (typically dATP, dGTP, dCTP, and dTTP) in 

a reaction medium comprised of the appropriate salts, 
metal cations, and pH buffering systems. Suitable 
polymerizing agents are enzymes known to catalyze 
template-dependent DNA synthesis. In some cases, 

10 the target regions may encode at least a portion of a 

protein expressed by the cell. In this instance, mRNA 
may be used for amplification of the target region. 
Alternatively, PGR can be used to generate a cDNA 
library from RNA for further amplification, the initial 

15 template for primer extension is RNA. Polymerizing 

agents suitable for synthesizing a complementary, copy- 
DNA (cDNA) sequence from the RNA template "are reverse 
transcriptase (RT) , such as avian myeloblastosis virus 
RT, Moloney murine leukemia virus RT, or Thermus 

20 thermophilus (Tth) DNA polymerase, a thermostable DNA 

polymerase with reverse transcriptase activity marketed 
by Perkin Elmer Cetus, Inc. Typically, the genomic RNA* 
template is heat degraded during the first denaturation 
step after the initial reverse transcription step 

25 leaving only DNA template. Suitable polymerases for use 

with a DNA template include, for example, E. coli DNA 
polymerase I or its Klenow fragment, T4 DNA polymerase, 
Tth polymerase, and Taq polymerase, a heat- stable DNA 
polymerase isolated from Thermus aquaticus and 

30 commercially available from Perkin Elmer Cetus, Inc. The 

latter enzyme is widely used in the amplification and 
sequencing of nucleic acids. The reaction conditions for 
using Taq polymerase are known in the art and are 
described in Gelfand, i989, PGR Technology, supra. 

35 
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Allele-specif ic PGR differentiates between 
chromosome 7 target regions differing in the presence or 
absence of a variation or polymorphism. PGR 
amplification primers are chosen which bind only to 
certain alleles of the target sequence. Thus, for 
example, amplification products are generated from those 
chromosome 7 sets which contain the primer binding 
sequence, and no amplification products are generated in 
chromosome 7 sets without the primer binding sequence. 
This method is described by Gibbs, Nucleic Acid Res. 
17 : 12427-2448 (1989) . 
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Further diagnostic screening methods employ the 
allele-specif ic oiigonucleotide (ASO) screening methods, 
as described by Saiki et al . , Nature 324 :163-166 (1986). 
Oligonucleotides with one or more base pair mismatches 
are generated for any particulai^ allele.. ASO screening - 
methods detect mismatches between variant target genomic 
or PGR amplified DNA and non-mutant oligonucleotides, 
showing decreased binding of the oligonucleotide 
relative to a mutant oligonucleotide. Oligonucleotide 
probes can be designed that under low stringency will 
bind to both polymorphic forms of the allele, but which 
at higher stringency, bind to the allele to which they 
correspond. Alternatively, stringency conditions can be 
devised in which an essentially binary response is 
obtained, i.e., an ASO corresponding to a variant form 
of the GYPllal gene will hybridize to that allele, and 
not to the wildtype allele. 



28 




wo 00/69882 



PCT/USOO/13168 



6. Ligase Mediated Allele Detection Method 

Target regions of a test Subject's DNA can be 
compared with target regions in unaffected and affected 
family members by ligase-mediated allele detection. See 
Landegren et al . , Science 241 : 1077-1080 (1988). Ligase 
may also be used to detect point mutations in the 
ligation amplification reaction described in Wu et al . , 
Genomics 4:560-569 (1989). The ligation amplification 
reaction (LAR) utilizes amplification of specific DNA 
sequence using sequential rounds of template dependent 
ligation as described in Wu, supra, and Barany, Proc. 
Nat. Acad. Sci . 88:189-193 (1990). 

7. Denaturing Gradient Gel Electrophoresis 

Amplification products generated using the 
polymerase chain reaction can be analyzed by the use of 
denaturing gradient gel electrophoresis . Different 
alleles can be identified based on the different 
sequence-dependent melting properties - and , 
electrophoretic migration of DNA in solution, DNA' 
molecules melt in segments, termed melting domains, 
under conditions of increased temperature or 
denaturation. Each melting domain melts cooperatively at 
a distinct, base-specific melting temperature (Tm) . 
Melting domains are at least 20 base pairs in length, 
and may be up to several hundred base pairs in length. 

Differentiation between alleles based on sequence 
specific melting domain differences can be assessed 
using polyacrylamide gel electrophoresis, as described 
in Chapter 7 of Erlich, ed. , PGR Technology, Principles 
and Applications for DNA Amplification, W.H. Freeman and 
Co, New York (1992), the contents of which are hereby 
incorporated by reference . 
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Generally, a target region to be analyzed by 
denaturing gradient gel electrophoresis is amplified 
using PGR primers flanking the. target region. The 
amplified PGR product is applied to a polyacrylamide gel 
with a linear denaturing gradient as described in Myers 
et al., Meth, Enzymol. 155:501-527 (1986), and Myers et 
al . , in Genomic Analysis, A Practical Approach, K. 
Davies Ed. IRL Press Limited, Oxford, pp. 95-139 (1988), 
the contents of which are hereby incorporated by 
reference. The electrophoresis system is maintained at a 
temperature slightly below the Tm of the melting domains 
of the target sequences. 

In an alternative method of denaturing gradient gel 
electrophoresis, the target sec[uences may be initially 
attached to a stretch of GG nucleotides, termed a GC 
clamp, as described in Chapter 7 of Erlich, supra, 
Preferably, at least 80% of the nucleotides in the GC 
clamp are either guanine or cytosine. Preferably, the GC 
clamp is at least 3 0 bases long. This method is 
particularly suited to target sequences with- high Tm*s. 

Generally, the target region is amplified by the 
polymerase chain reaction as described above.' One of the 
oligonucleotide PGR primers carries at its 5' end, the 
GG clamp region, at least 30 bases of the GG rich 
sequence, which is incorporated into the 5' end of the 
target region during amplification. The resulting 
amplified target region is run on an electrophoresis gel 
under denaturing gradient conditions as described above. 
DNA fragments differing by a single base change will 
migrate through the gel to different positions, which 
may be visualized by ethidium bromide staining. 

8 . Temperature Gradient Gel Electrophoresis 

Temperature gradient gel electrophoresis (TGGE)is 
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based on the same underlying principles as denaturing 
gradient gel electrophoresis, except the denaturing 
gradient is produced by differences in temperature 
instead of differences in the concentration of a . 
chemical denaturant. Standard TGGE utilizes an 
electrophoresis apparatus with a temperature gradient 
running along the electrophoresis path. As samples 
migrate through a gel with a unifonn concentration of a 
chemical denaturant, they encounter increasing 
temperatures. An alternative method of TGGE, temporal 
temperature gradient gel electrophoresis (TTGE or tTGGE) 
uses a steadily increasing temperature of the entire 
electrophoresis gel to achieve the same result- As the 
samples migrate through the gel the temperature of the 
entire gel increases, leading the samples to encounter 
increasing temperature as they migrate through the gel. 
Preparation of samples, including PGR amplification with 
incorporation of a GC clamp, and visualization of 
products are the same as for denaturing gradient gel 
electrophoresis . 

9. Single-Strand Conformation Polymorphism AriaLly's is 

Target sequences or alleles at the CYPllal locus 
can be differentiated using single-strand conformation 
polymorphism analysis, which identifies base differences 
by alteration in electrophoretic migration of single 
stranded PGR products, as described in Orita et al., 
Proc. Nat. Acad. Sci. 86:2766-2770 (1989). Amplified PGR 
products can be generated as described above, and heated 
or otherwise denatured, to form single stranded 
amplification products. Single-stranded nucleic acids 
may refold or form secondary structures which are 
partially dependent on the base sequence. Thus, 
electrophoretic mobility of single-stranded 
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amplification products can detect base-sequence 
difference between alleles or target sequence's. 

10. Chemical or Enzymatic Cleavage of Mismatches 

Differences between target sequences can also be 
detected by differential chemical cleavage of mismatched 
base pairs, as described in Grompe et al . , Am. J. Hum. 
Genet. 48:212-222 (1991). In another method, differences 
between target sequences can be detected by enzymatic 
cleavage of mismatched base pairs, as described in 
Nelson et al . , Nature Genetics 4:11-18 (1993) . Briefly, 
genetic material from a patient and an affected family 
member may be used to generate mismatch free 
heterohybrid DNA duplexes. As used herein, 
"heterohybrid" means a DNA duplex strand comprising one 
strand of DNA from one person, usually the patient, and 
a second DNA strand from another person, usually an ' ' 
affected or unaffected family member. Positive selection 
for heterohybrids free of mismatches allows 
determination of small insertions, deletions or other 
polymorphisms that may be associated with alterations in 
androgen metabolism. ' . . . . . : ' - [■ . . 

11. Non-PCR Based DNA Diagnostics 

The identification of a DNA sequence linked to 
CYPllal can be made without an amplification step, based 
on polymorphisms including restriction fragment length 
polymorphisms in a patient and a family member. 
Hybridization probes are generally oligonucleotides 
which bind through complementary base, pairing to all or 
part of a target nucleic acid. Probes typically bind 
target sequences lacking complete complementarity with 
the probe sequence depending on the stringency of the 
hybridization conditions. The probes are preferably 
labeled directly or indirectly, such that by assaying 
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for the presence or absence of the probe, one can detect 
the presence or absence of the target sequence. Direct 



with or ^^S. Indirect labeling methods include 
fluorescent tags, biotin complexes which may be bound to 
avidin or streptavidin, or peptide or protein tags. 
Visual detection methods include photoluminescents , 



and 3, 3', 5, 5 ' - tetramethylbenzidine (TMB) , 
fluorescein, and its derivatives, dansyl, umbellif erone 
and the like or with horse radish peroxidase, alkaline 
phosphatase and the like. 

Hybridization probes include any nucleotide . 
sequence capable of hybridizing- to the porcine 
chromosome where CYPllal resides, and thus defining a 
genetic marker linked to CYPllal, including a 
restriction fragment length polymorphism, a 
hypervariable region, repetitive element, or a variable 
number tandem repeat. Hybridization probes can be any 
gene or a suitable analog. Further suitable 
hybridization probes include- exon fragme^rts, or porti'ons' - 
of cDNAs or genes known to-- map to - the -re-levant region- of .:;- 
the chromosome . . 

Preferred tandem repeat hybridization probes for 
use according to the present invention are those that 
recognize a small number of fragments at a specific 
locus at high stringency hybridization conditions, or 
that recognize a larger nxamber of fragments at that 
locus when the stringency conditions are lowered. 

The following examples are provided to illustrate 
embodiments of the present invention. They are not 
intended to limit the invention in any way. 



labeling methods include radioisotope labeling, siich as 



Texas red, rhodamine and its derivatives, red leuco dye 
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EXAMPLE I 

A 6eziet:lc Marker £or Meat: Quality, Growth, Carcass and 
Reproductive Traits in Pigs 

5 In accordance with the present invention, a genetic 

marker has been identified and characterized which is 
associated with improved meat quality and improved 
growth and carcass traits in pigs. The following 
materials and methods were utilized in the practice of 

10 Example I. 

Testis tissue samples were obtained from fifty- 
Yorkshire boars for which growth, carcass, and boar 
taint data had previously been collected. Boars were 
weaned at approximately 10 weeks of age, assigned to 

15 pens, and fed a standard grower-finisher diet to a final 

weight of approximately 12 0 kg. Boars were killed by 
electrical stunning and exsanguination at the Penn State 
University meats Laboratory. Testes, bulbourethral 
glands and submaxillary salivary glands were collected, 

20 trimmed, and weighed. Carcasses were weighed and then 

chilled overnight. The following day data were collected 
for standard carcass measurements^ such as carcass 
length, loin eye area, fat depth and marbling. 

The assay for submaxillary salivary gland delta-16- 

25 androstenes was adapted from a procedure developed by 

Squires (J. Animal Sci . 69: 1092-1100, 1991). Briefly, 
submaxillary salivary glands were trimmed and minced in 
a food processor (Cusinart) and one gram of minced 
tissue was placed in a 50 ml test tube. Methanol (5 ml) 

30 was added and the mixture was homogenized for 3 0 sec by 

Polytron. Samples were placed in a centrifuge for 5 min 
@ 2 800 rpm. Three ml of distilled water were added to 3 
ml of the supernatant and mixed by vortexing. six ml of 
hexane were added to extract the delta-16-androstenes . 

35 The mixture was vortexed and allowed to stand for 5 min 

34 
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for the phases to separate. Three milliliters of the 
organic phase were transferred to a glass culture tube 
and the extract was dried under nitrogen while in" a 
water bath (30°C) . Color reagents were added (.5 ml of 
5 .5% resorcylaldehyde in glacial acetic acid plus .5 ml 

of 5% sulfuric acid in glacial acetic acid) . The tubes 
were placed in a heat block for 10 min at 95 C. 
Development of a violet color, an index of the presence 
of delta-16-androstenes , was measured by pipetting 100 

10 pi of the test solution into a well in a 96-well 

microplate. Absorbance was measured at several 
wavelengths near the known absorbance maximum for A16- 
androstenes (593 nm) and compared against standard test 
solutions containing 5a-androst-16-erie-3 3-ol * (the 

15 predominant A16-androstene in the submaxillary *salivary 

gland) . Concentration of A16-androstenes was e&tablished 
by generation of a standard curve with the" standaircl ;test 
solutions. 

Data were analyzed by ANOVA using the GLM., 

20 procedures of SAS (1-990) . 

Testis tissue samples were obtal tied from stpraige 
{-20'*C) for ten boars: five that ' had 6he highest' 
concentrations of Al6-androstenes (high boar ta'int) and 
five that had the lowest concentrations of A15- 

25 androstenes {low boar taint) . DNA was extracted by 

Proteinase K digestion. Approximately 50 mg of testis 
tissue was wrapped in aluminum foil and frozen in liquid 
nitrogen. The sample was then pulverized and 
approximately 2 0 mg was placed in a micro fuge tube with 

30 .5 ml digestion buffer (50 mM Tris, pH 8.5; ImM EDTA; 

0.5% Tween 20; 2 00 pg/ml proteinase K (Gibco Life 
Technologlies, Grand Island, NY) . Proteinase K was 
stored at -20°C in stock solution (20 mg/ml proteinase 
K; 1-mM Tris-HCl, pH 7 . 5 ; 20 mM calcium chloride, and 5% 

35 glycerol) . The samples were suspended in digestion 
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buffer and placed in a water bath @ 55 "^C for 3 hours - 
Scimples were centrifuged for 1 min-@13,600 g and placed, 
in a heat block for 10 min @ 95 °C. Samples were removed 
and stored at -20 "^C until analyzed. 
5 Four sets of primers were obtained which 

corresponded to. approximately 600 bp each for a total of 
approximately 2.4kb of the 5'UTR of the porcine CYPllal 
gene (sequence obtained from Urban, et al., J. Biol. 
Chem. 269:25761-25769, 1994). See Figure 1. Polymerase 
10 Chain Reactions were initiated for each primer set for 

each of the ten DNA templates . PGR was performed as 
follows . 

1. 2 min @ 94 C. 

2 . 1 min @ 94 C - " ' ' 
15 3.1 min @ 55 C 

4. 1 min @ 72 C 

5. 3 5 cycles to (2.) 

6. 2 min @ 72 C . . 

7 . hold at 5 C - 

20 

Reactions were performed using lOx -buffer (W/MgClsO ; 
dNTP • s (10 nmol ) ; primer CYPsdc- - For 1 ' (2 0' ^ pmoi )' ; primer - ^ ~ 
CYPscc Revl (20 pmol) ; Taq polymerase ;ddH20 and DNA 
template (1:10 dilution of Proteinase K digested sample, 

25 approximately 100 ng) . 

PGR products were analyzed by agarose gel 
electrophoresis, and the -600 bp bands cut out of the 
agarose gel and purified using the QIAquick gel 
extraction kits (QIAGEN Inc., Valencia GA) . The 

30 nucleotide sequences of each of the forty PGR products 

was determined in both forward and reverse directions 
using an ABI Prism Model 377 Sequencer (Perkin Elmer, 
GA) at the Penn State Nucleic Acid Facility, PSU 
Biotechnology Institute . 
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The sequences of the PGR products were aligned * ' 
manually and examined for differences between the ten 
animals. While there were 37- differences ih-the samples 
when compared with the published sequence (Urban et al . , 
5 1994, supra), there was only one base pair that varied 

among this grpup of animals. At position -155 (155 bases 
before the start site ATG codon) , six. of the samples had 
the cytosine (CC) , and four were polymorphic; that is 
they had both the cytosine and thymidine (CT) , 

10 indicating heterozygosity at that base pair. Of 

significant interest was' that all -five*- of the high taint 
boar samples were the CC genotype, whereas four out of 

five of the low taint boar samples had the CT genotype 

This polymorphism was located in a restriction 

15 enzyme recognition site such that the presence of the T 

allele would change the restriction fragment length 
pattern observed after digestion with specific 
restriction enzymes. In; this particular case, the 
restriction enzyme used was SphI (New England Bialabs) . 

20 Presence or absence of the T allele in. the DNA samples 

from the full group of fifty- -boars, was.j determined-^^by ^ ---^ 
Restriction Fragment Length Polymorphism-- a^na lysis - • 
involving examination of restriction digests of CYPlla"! 
5'UTR using SphI For exemplary gel, see Figure 2. 

25 Presence of the T allele, either homozygous (TT) or 

heterozygous (CT) was associated with low boar taint. 
Presence of the CC allele was associated with high boar 
taint/ as well as with increased testis weight, 
increased bulbolurethral gland length and weight, and 

30 increased submaxillary salivary gland weight. See 

Figure 3 and Table 4. In addition, boars that possessed 
the CC allele exhibited a 5.9% improvement in rate of 
gain, and had greater amounts of lean muscle as 
evidenced by longer carcasses, and tended to have less 

35 fat as determined by backfat depth measurements. Boars 
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with the CC allele also tended to have higher 
concentrations of serum testosterone in blood samples 
taken at slaughter. - . • . 

A retrospective analysis of production records of 
5 direct female relatives (dams and siblings) of these 

boars revealed that those females related to boars 
possessing the T allele tended to have slightly larger 
litter sizes (+.31 pigs/litter) and weaned heavier 
litters (+4.27 kg). Thus this polymorphism appears to 
10 confer beneficial fertility and productivity traits to 

female pigs . 

EXAMPLE II 

A Genetic Marker for Meat Quality, Growth, Carcass and 
15 Reproductive Traits in Cows and Chickens 



The identification and characterization of the 
CYPllal polymorphism in pigs facilitates the 
characterization of the corresponding polymorphism in 

20 bovines which are associated with improved reproductive 

and carcass traits. The bovine CYPllal sequence is 
provided in Figure 5 . A suitable primer set for 
amplifying the bovine homologue of the 5' UTR for the 
CYPllal gene has the following sequences: Sense: 

25 5 ' -GCAGATGTCCCTGGTGATTC-3 ' ; and Antisense: 

5 ' -TGAACGGAGGGGAAGCC-3 ' . 

Amplified bovine CYPllal sequences and 
corresponding genetic traits are then characterized as 
set forth herein for the porcine CYPllal gene. 

30 Figure 6 depicts the CYPllal gene from chicken. In 

order to assess genetic changes in a more lengthy 5 'UTR 
sequence from the chicken CYPllal sequence provided in 
Genbank, the cDNA sequence provided in Figure 6 is 
utilized as the basis for 5' rapid amplification of cDNA 

35 ends (RACE) using a kit from Clontech containing RACE- 
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ready cDNA prepared from chicken. Clones obtained from 
this RACE approach yield 5' end points of the chicken 
CYPllal sequence for further analysis df genetic changes 
in the 5'UTR associated with improved reproductive and 
5 carcass traits. Genetic polymorphisms and alterations 

so identified are within the scope of the present 
invention. ISuitable protocols for practicing RACE are 
provided in Current Protocols of Molecular Biolocp/ , J. 
Wiley & Sons, Inc. 1998, Chapter 15.6.9, the entire 
10 disclosure of which is incorporated by reference herein. 

The present invention is not limited to the 
embodiments specifically described above, but is capable 
of variation and modification without departure from* the 
15 scope of the appended claims . 
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